Multichannel Speech Enhancement Based on Generalized Gamma Prior Distribution with Its Online Adaptive Estimation
نویسندگان
چکیده
We present a multichannel speech enhancement method based on MAP speech spectral magnitude estimation using a generalized gamma model of speech prior distribution, where the model parameters are adapted from actual noisy speech in a frame-by-frame manner. The utilization of a more general prior distribution with its online adaptive estimation is shown to be effective for speech spectral estimation in noisy environments. Furthermore, the multi-channel information in terms of crosschannel statistics are shown to be useful to better adapt the prior distribution parameters to the actual observation, resulting in better performance of speech enhancement algorithm. We tested the proposed algorithm in an in-car speech database and obtained significant improvements of the speech recognition performance, particularly under non-stationary noise conditions such as music, air-conditioner and open window. key words: multi-channel speech enhancement, speech recognition, generalized gamma distribution, moment matching
منابع مشابه
Gamma Modeling of Speech Power and Its On-Line Estimation for Statistical Speech Enhancement
This study shows the effectiveness of using gamma distribution in the speech power domain as a more general prior distribution for the model-based speech enhancement approaches. This model is a superset of the conventional Gaussian model of the complex spectrum and provides more accurate prior modeling when the optimal parameters are estimated. We develop a method to adapt the modeled distribut...
متن کاملProceedings of Meetings on Acoustics
Estimation of the power spectral density (PSD) of noise is crucial for retrieving speech in a noisy environment. 3 novel methods for estimating the non-white noise PSD of noisy speech based on a generalized gamma distribution and 3 criterions are proposed, which are minimum mean square error (MMSE), maximum a posteriori (MAP) and Maximum likelihood estimation (MLE). Because of the highly non-st...
متن کاملNoise Power Spectral Density Estimation based on Maximum a Posteriori and Generalized Gamma Distribution
Noise power spectral density (PSD) estimation is a crucial part of speech enhancement system due to its contributory effect on the quality of the noise reduced speech. A novel estimation method for color noise PSD on the basis of an assumption of generalized Gamma distribution and maximum a posteriori (MAP) criterion is proposed. In the experiment, generalized Gamma PDF which is a natural exten...
متن کاملMMSE estimation of complex-valued discrete Fourier coefficients with generalized gamma priors
We consider DFT based techniques for single-channel speech enhancement. Specifically, we derive minimum mean-square error estimators of clean speech DFT coefficients based on generalized gamma prior probability density functions. Our estimators contain as special cases the well-known Wiener estimator and the more recently derived estimators based on Laplacian and twosided gamma priors. Simulati...
متن کاملWeighted Log-spectral Amplitude Estimation with Generalized Gamma Distribution under Speech Presence Probability
In this paper, we propose a speech enhancement approach. The approach is based on deriving weighted log-spectral amplitude estimator that exploits the generalized Gamma distributed speech priors under speech presence probability. The log-spectral amplitude estimator is weighted by psychoacoustically motivated speech distortion measure to take advantage of the perceptual interpretation. The expe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IEICE Transactions
دوره 91-D شماره
صفحات -
تاریخ انتشار 2008